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On Slot Allocation for Time-Constrained 
Messages in Dual-Bus Networks 

Ching-Chlh Hah. memdm IEEE, ChacKlu Hou, Member, IEEE, 
and Kang G. Shin. Fii/fiSiv. IEEE 

|«|« T«n»--MAC pfotoc* du»M)« n«^^ 

— ♦ — 

1 Introduction 

' slot contains an Access Control Field (ACF) and a payload. 

. a-a HaiBnd a^l Hou $n mtb the Oepai^d^Btx^ Mngfneer^ ^ ^ j^q^ ^ of particular interest: 

E^/- (ataifiou}^teM^M>^ I) the fcusy bit. which indicates whether or not the slot Is 

Etertomgfo^ Tfte uw*«sfiKrflWid*aa ^|^*;^.tlme bit (or the pf«^arti£rat/on bit), which Indl- 

^l^tf!^ cates whether or not the slot has been pre-assigned by 

M»nmm^0M.i996:^4F^i99t generator to some Isochronous message 

ftrtefofinrtfanortcbtfiniMfwrt^ stieam.and 
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. 3) the Wmia/ cfrcoft kkntUUr (VCI). which indicates to 
which Isochronous stream the slot is assigned If the 
real-time bit Is set. 
Access to the slots Is managed by using these three fields. 
We assume that messages to be transmitted on the buses 
are dhrided into one or more Bxed-length packets/cells, 
and packet/cell size matches the payload size of a slot Le., 
each message cell needs one slot time for its transmls^n. 

The slotted dual4>us network configuration described 
above is general enough to accommodate several MAC 
schemes suggested for this topology, e.g.. DQDB I32K 
Fasnet |371. CRMA Hi), and Single 136), to name a few. 
Under the s)otted dual-bus network conRguradoa wc first 
characterize each isochronous message stream Mj with two 
parameters: relative message deadline and niaximum 
(total) message size C, (measured In packets) that can arrive 
within any time interval of length D^. Second, we formally 
define the slot allocation problem In slotted diial*bus net- 
works, and devise a slot allocation scheme as a solution to 
the problem. Our solution approadi is to devise an on-line 
scheduler that preallocate slots to a set of isochronous mes- 
sage streams {M,= (Cj, 1 1 S i 5 n} in such a way that in 
any time Interval of lengdi D^ there are at least C| slots alk>- 
cated to M| for all f. Based on the scheduler* we then pro- 
pose a slot allocation sdieme that can be readily used by the 
slot generator to generate slot allocation schedules. The 
proposed slot allocation scheme is guaranteed to find a fea- 
sible skit albcation schedule to satisfy the above criterion 
and ensures that all Isodironous messages can be transmitted 
before tfieir deadlines as long as the total message den^ Is" 
less than or equal to a certain threshold, where the total mes- 
sage density Is deBned as the summation of the ratio of 
maximum message size to message deadline over all Iso- 
chronous streams, le^ ZC/D^ Finally, we elaborate on how 
to implement the proposed scheme as a SIotManager 
daemon or a SIotManager chip that resides in the slot 
generator. 

Numerous researchers have studied the slotted dual4>us 
networks in terms of the design of MAC schemes for non- 
real-tlme traffic 136). (37). 141). the fairness issues |8). (21). 
122), 1441. and the queuing performance |9). |101. |34). By 
contrast, only a few of them have focused on slot allocation 
for real*time communication I3S)« 140]« 143). [iSl 146). 
147). (Most of them are In the domain of DQDB networks 
perhaps except for 1461. 147|.) Potter et al. |43| proposed a 



request control scheme to guarantee bandwidth for queue- 
arbitrated access. In combination with a traffic shaping 
mechanism. Martini ct aL )391. f40) proposed a guaranteed 
bandwidth (GBW) * protocol to arblUate the -queue- 
arbitrated slots for the connectkm-oriented services. Both 
schemes proposed in 1391. |40). |43) do not provide any per- 
formance guarantee for rea)-time traffic. Sha et al. |46). (47) 
proposed a global priority scheme to guarantee message 
deadlines and prevent priority inversion. The maior draw- 
back of the scheme Is that It requires a lar^ number of pri- 
ority levels which may not be supported in realistic net- 
works, Chan et al. Ill) proposed a reservation-arbitrated 
access scheme exclusively for isochronous voice transport, 
and achieved statistical multiplexing in isochronous serv- 
ices by allowing voice packets to be occasionally dropped. 
As a result, dieir scheme may not be well-suited for em- 
bedded real-time systems widi hard deadline constraints. 

The scheme proposed by Saha et al. In 145| comes closest 
to ours- However, their scheme differs from ours in formu- 
laUng and solving the problem. They adopt the peak-rate 
message model 17). in which each message stream M| is 
characterized by three parameters: minimum message in- 
ter-arrival time maximum message size C,. and relative 
message deadline D, (D, S PJ. We will show in Section 2 
that their message model is a special case of ours (Le.. more 
restricted than ours). They first devised a bandwidth allo- 
cation scheme based on cydic reservation and derived the 
schedulablliQ^ condition under two assumptions: 

1) message streams are all periodic with periods P, « Dj 
— for all /.and ' 

2) message arrivals are aligned with the starts of alloca- 
tton t^es. and the length I of the allocation cyde 
evenly divides P, for ail i. 

They then relaxed these assumpttons and modified thelr 
scheme to handle the general case. The resulting modified 
scheme may not be able to schedule message-stream sets 
wldi some Di « L in the worst case (see Section 5.1 for more 
details)* In contrast we formulate the problem in a very 
general setting. Le.. we formulate the slot allocation prot>- 
iem in such a way that any stot albcadon scheme that 
solves this proUem should be able to provide deterministic 
deadline guarantees for artatmy Isochronous message 
streams. By "arbitrary.* we mean that the message arrivals 
In an Isochronous stream are not required to be periodic or 
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sqiarated by a minimum Interarrtval time, and the (flrat) 
message arrivals In different streams are not required to be 
in phase (te.. aligned with one another) or aligned with any 
time instant Moreover, the proposed scheme has an easy- 
to-test schedulabiiity condition. Le.. as long as the total 
message density Is less than or equal to a certain 
threshold, the proposed scheme can always find a feasible 
slot allocation schedule. 

The rest of the paper is organized as foltows. In Section 2. 
we describe the MAC specification for real-time traffic used 
in the dual-bus network considered in this paper, discuss 
the message model used to characterize Isochronous 
streams, and define the total message density of a set of 
Isochronous streams. In SccUon 3, we formaUy define the 
slot allocation problem, and discuss how to solve the prob- 
lem by generalizing the results of the phtwheel proUm 1121. 
I13|. 128|, |291, In Section 4, we propose an on-line slot allo- 
cator which takes a set of message streams as the input and 
generates the corresponding slot allocatton schedule in 0(n) 
time per slot allocated, and a slot allocation scheme of 
polynomial time complexity. We also discuss the Imple- 
mentation details of the proposed scheme there. In Section 5, 
we compare our scheme widi that proposed in 145) and 
give a few remarks on possible extension to this work. We 
conchide the paper with Section 6. 

2 MAC Protocol AND Message Model 
2.1 Access Control for Real-Time Traffic 

Many MAC protocols have been devdqjed for non-real-dme 
trafiRc CO ensure lack of stanration and some degree of feimess. 
For examine. Sin^k |36]. Fasnet |37|. and CRMA 141] control 
the access to the bus by use of cycles, and by Imposing a limit 
on the number of slots diat can be used by eadi stadon in a 
cycle DQDB [»1 implements a distributed global queue by 
using two counters, countdiown counter and request counts. 
What Is lacking Is a MAC protocol for ceal-dme trafBc The 
Intent of this paper Is to lay a formal basis of such a protocol 
by devising a slot allocation sd)eme for real-time trafik. 

Before elaborating on the detailed derivation of die pay- 
posed slot allocation scheme, we first give the specification 
for isochronous services based loosely on the IEEE 802.6 
standard 1321. 1331. The dual-bus network uses a preatbitra- 
tion CPA) scheme for Isochronous (real-time) traffic. Each 
isodironous message stream is given a unique VCl. The slot 
generator at die head of the bus Is responsible for reserv- 
ing/marking sufficient empty slots for isochronous mes- 
sage streams by setting 

1) the real-time bits in die slots and 

2) the VCI fields In die slots to die VCIs of appropriate 
isodironous message streams. 

The Stations wltii an isochronous stream Uien watch for 
prcarbltrated (PA) slots wldi the appropriate VCIs and 
transmit their isochronous messages using those slots. If an 
empty slot is not pre-asslgned for isochronous message 
streams' or if die VCl field In an empty prcassigned slot 

1. Unasslgned eins>ty slots af« aftiiUated among slatim 
schemes for non-ftai4lme traffic 



does not match die VCI of any of die Isochronous message 
streams emanating from a station, the station simply trans- 
ports die PA stot downstream. The slot generator must en- 
sure that the slots for each isochronous message stream are 
properly preasslgncd so as to guarantee die timely delivery 
of the messages in each isochronous stream. 

2^ Message Model 

Before deWlng into the issue of allocating network band- 
widdi for messages widi deliv^ deadlines, one must 
specify die traffic characteristics and timing requirements 
of diese messages. Ut M - {M,. ^f^ .... be a set of n 
isochronous message streams (each with a unique VCI) In 
die dual-bus network. Note that for each station, diere may 
be 2CIO. one. or more isochronous message streams ema- 
nating fitom it We use a message model similar to die (r, 7)- 
SiTOOth traffic model 1191. 1201 In which each message stream 
M, Is described by a two-tuple (C,. Dj: 

• C| is the maximum number of packets/cells In M, tfiat 

• can arrive In any time interval of length D| (or. sim- 
ply, the maximum message size of M), and 

• D| is the rpiatfvi? transmission deadline (or simply, the 
deadline) for die messages in M,. i.e.. if a message of 
M| arrives at time t dien it must be transmitted by 
time t<»>D|. 

This model Is, in fact a generalization of two conunonly- 
used real-time traffic models: the peak-rste model f71. and 
die linear bounded model llSj. 1161. In die peak-rate model, 
each stream Mj Is characterized by a triplet (C|. Oi. P^). 
where 

• Pi Is die minimum Interarrival period for i.e., if the 
JIh message of arrives at time t then die {/ + l)th 
message in the stream will arrive at a time no earlier 
dian t ♦ P| for all I (if messages in M, arrive peri- 
odically, then Pi Is the period). 

• C| Is the maximum message size measured in cells in 

Mj, l,e. C} is the number of slots needed to transmit a 
maxtmum-size message in Mj. and 

• D,(^Pj is die ti^nsmtssion deadline for the messages 

InH- 

In the (C, O-smoodi message model we used, the. inter- 
arrival time of two successive messages in M| is not re- 
quired to be larger than or equal to D| (Le.. more than one 
message may arrive In a time Interval of length 5 DJ. How- 
ever, the total message size measured in celb in M, that ar- 
rive in any time interval of lengdi D, should not exceed C,. 
In the peak-rate model, during any time interval of lengtii 
P„ at most one message of size less than or equal to C, will 
arrive. D| S P| implies tfiat the total message size of the mes- 
sages In M, that arrive in any time Interval of lengdi D, Is 
bounded by The peak-rate model is simply a special case 
of the (C. Dj-smooth message model because die peak-rate 
model is more restricted than the (C. I3)-smooth model in 
the sense diat any message stream that satisfies the charac- 
teristic (C|. Df. Pi of die peak-rate model also satisfies the 
characterfsOc (C|. D) of the (C £9-smooth model. 

In the linear bounded model, each message stream M| Is 
characterized by a two-tuple {pp P), where pi Is the maxi- 
mum message arrival rate, and A Is die maximum burst 
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Size, A real-time traffic Is said to follow the linear bounded 
model If the number of cells arrived In any time Imerval of 
length f Is bounded by the linear function Pi - t + /J|. The 
linear bounded model can be Implemented l>y die teaky 
bwtet ISll or token buckH [42| mechanism with a token gen- 
eration rate p,and a bucka size ft. It Is easy to see that by 
letting Ci'^prDi* fii. the linear bounded model becomes a 
special case of the (C. Xfl-smooth model because the linear 
bounded model Is more restricted than the (C. C}-smooth 
model In the sense that any message stream diat satisfies 
the charactedstic (p^ pj of the linear bounded model also 
satisfies the characteristic (C|. DJ » (P/ ' D| + A ^ 
(C D)-smooth model. 

Our slot allocation scheme is designed based on the 
above (C. P)-smooth model and. hence, can also be used 
for message streams that conform to the peak-rate model or 
the linear bounded model However, it Is worth mentioning 
that the peak-rate model describes the "woref case see- 
nario of the (C O-smooth model in the sense that it Is the 
most •difficult" situation for the slot attocation scheme to 
meet the deadline constraints of die messages In M,. Note 
that if each message In M, arrives D, « P| units of time after 
the previous message and with a message size then all 
the C| cells of each message In ^f, must be transmitted 
within D| time units after Its arrival and hence, the slot al- 
location scheme must ensure that there are enough slots 
(ie., at least C, slots) assigned to M| during any time Inter- 
val of lengtit 0^ 

Since data transmission on the dual buses Is slotted, and 
message arrivals may not be aUgned viith slot boundaries, 
we need to express in number of slots, and consider the 
foct that messages may arrive In the middle of a slot Let £, 
denote die lengdi of a slot, and denote die effecUve 
deadline expressed In slot times. £^ can be exfmssed as 

p; = [D,/Ljll. (2.1) 

The floor function and tiie "-1" term In (2.1) come from the 
fort diat D, may not evenly dWide I, and message arrivab 
may not be aligned wldi slot boundaries. For notational 
convenience, we assume In die following discussion dwt D, 
is the effective deadline expressed in number of slots. Note 
that as mentioned earlier; message ceU size matches the 
payload size of a slot, so we may think of Q as measured in 
slots. 

With die (C W smooth message model the notion of 
timing guarantee can be stated as: the slot generator must 
ensure that empty PA slots are properly assigned to each 
isochronous stream M, so that each message In M, is trans- 
mitted within a time period S D, after its arrival as long as 
the maximum (total) message size in any time Interval of 
lengdi D, Is S In odier words, die slot generator roust 
assign at least C, slots to M, between die arrival time and 
the deadline of any message of and because each mes- 
sage In M, may arrive at any time, it also ImpUes that the 
slot generator must assign at least C, slote to M, during any 
time interval of length Df We assume the presence of a 
suitable policing mechanism that marks cells which violate 
die traffic characteristics declared No service Is guaranteed 
to cells marked by the poUdng mechanism. 



2.3 Message Density of Is chronous Traffic 

We define the messap denary of an isochronous stream M, 
as p(M) » C/D^ and die total message density of a set of 
Isochronous streams M ■ (Mj> M| MJ ss 

p(M) = ip(M,) = i§. (2-2) 

Researchers In die field of real-time computing usually 
deftne die sdudulMUty criterion for gMaranteelng a set of 
periodic tasks using some priority-driven preemptive 
scheduUng approach by giving the worst-case achievable 
processor utilization (31 1381. The message density defined 
above Is ^mllar to the processor utilization defined in real- 
time scheduling. In die following discussion, we propose a 
slot aUocation scheme for Isochronous traffic In a dual-bus 
network. Moreover, we give the woret-case achievable total 
message densl^ for the scheme. That is. the slot aUoca- 
tion scheme is guaranteed to find a feasible slot aUocation 
schedule (in die sense diat all messages In M can be trans- 
mitted in time) for any set M of Isochronous streams as long 
as p{M) ^ p^. Note, however, tfiat p»«) > p* docs not neces- 
sarily imply dwt M cannot be feasibly scheduled by the 
proposed stot allocation scheme (more on this vdU be dis- 
cussed later). 

3 The Slot Allocation Problem 

As discussed in Section 2.2. each isochronous message must 
be transmitted wldiln a time period S D, after Its arrival 
Hence, we formally deftne die slot allocation problem as 
foUows. 

PROBLEM I (Stot Allocation Problem). Given a set of isodiro- 
nous message streams M = (M,- (C^ D) \ I ^ nl al- 
locate the slots In such a way that each stream M, Is 
guaranteed to transmit eadi of its messages before the 
message deadline D^ That Is. if a message of M, ar- 
rives at time t entntgh slots must be allocated for M| 
during time Interval |f. f ♦ DJ for the timely transmis- 
sion of die message. 
According to the message model, the maximum size of a 
message in Is C|. Therefore, If a message of M| wldi size 
C, arrives at time U dien the slot allocation scheme must 
aUocate at least Q slots for M, during time Interval It t -¥ Dj, 
Note that the exact time when a message in arrives Is not 
known a priori and message arrivals in different streams do 
not necessarily align with one another. Hence, one way for 
a slot allocation scheme to meet the above timeliness crite- 
rion is to assign at least Q slots to M, for any time Interval of 
lengdi 

Consider, for example. Fig. 2, where four possible slot 
allocation patterns for a soeam M, vrtth C, = 2 and D| = 6 are 
shown. The slot aUocation patterns in Fig. 2a do not satisfy 
the criterion that for any time Interval of length at least 
Q slots are allocated to M^, and a message of M| that arrives at 
time I for 1 ^ t i 5.cann<« meet Itsdellvciy deadUne f ♦ In 
Fig. 2b, the slots are so aUocated that the above criterion Is 
fulfilled, and hence. aU messages can meet tfielr delivery 
deadlines regardless of their arrival times. 
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Fig. 2, RMir possible slot allocalion patterns for message 



A straightforward result on the total message density is 
suted below. 

LEMMA U If the tota! message density of a set streams M is 
larger than I (Le,, > !)• Ctoi no ftasfAfe shtalloca- 
dan schedule exists forM. 

Proof. The lemma follows simply by obs^ng that each 
stream M, (1 ^ I ^ n) must be granted dots at least 
C/Di of the time. Clearly, this cannot be done for all 
streams If p(M) = ^ * ' ° 

In Section 4. we propose a scheme to solve the slot allo- 
cation problem defined above. The theoretical base of the 
proposed slot allocation scheme Is grounded on some of the 
results of the pinwheel problem stated below. 
PROBLEM 2 (The PinwheerProblem) {12]. |13K 128). 129]. 

Given a multiset of n positive integers A « {a,, az 

aj, find an Infinite sequence (schedule) over the sym- 
bols {1.2 n) such that there is at least one symbol 

T within any subsequence of a, consecutive symbols 
(stot^. 

For example, given a multiset A = {2. 4. 5}. one soluUon se- 
quence is (I. 2. 1, 3, 1, 2. L 3. where the subsequence 
(1. 2. 1» 3) repeats forever. In this solution sequence, we can 
find one T in every «i « 2 consecutive syn^ls. one •2" in 
every a^ = 4 consecutive symbols, and (at leasO one -3" In 
every a%^S consecutive symlx>ls. 

There are two Important results of the pinwheel problem 
upon which we will build our slot allocation algoritimi: 

1) If a pinwheel instance A with total density ^ 1 consists 
solely of multiples (U.. a, | aj for all / </ where a, \ % 
denotes a, divides aj^ md p(A) ^ 1 / a, 5 I), then 

A Is schedulable; 

2) If a pinwheel Instance A consists of only two distinct 
numbeis 0.e.. Instances of the fcnn (h h .... 6. C c 4) 
with total densi^ ^ I. then A Is schedulable. 



THEOREM 1 129]. Given a piimbed Instance A » (a,, a^ .... a^, 
Ifaf I aj &r atf l< J. andp(A) ^ 1« ti^ A li schedulaUe 

THEOREM 2 12BI Given a pinwheel Instanced = (6, fc. .... fc c c 
.... c) with p te and « cs. if p(A) « p/t ♦ 1. then A 

Isscberftifaftle. 

Eased on these two results. Chan and Chin |12). (13) de- 
vised several sdiedulers. e.g.. Sa. Sx. Sby. and Sxy, to 
schedule larger classes of pinwheel instances. The basic 
Idea behind Schedulers Sa and Sx is tiie slngte-lni^r reduc- 
tion technique, which alms to transform an arbitrary in- 
stance A to another instance A' = (^.4 con- 
sists solely of multiples and a( ^ a, for all t From Lemma 1 
and Theorem 1. we know tiiat A' can be feasibly scheduled 
If and only if p{AO ^ I. Since a; ^ a, (l.e,. A' is more re- 
stricted than A), if we find a schedule for A\ then the 
schedule also satisfies the original constraints for A. 

Without loss of generaliq^. we can assume that a^^ai^ 
— . £ a« and that the smallest number in A is a. i.e.. a » a,, in 
the following discussion. Scheduler Sa finds an ^ for each 

a, such that 

aJ = a.2^Sa,<a.2^**-2a;. 

for some Int^r J S 0. i.e., < = a - ^'^^^^'^^ (note tfiat all 
logarithms in this paper are to the base 2), This operation is 
called specializing A with respect to [aH, Since the instance 

A' = {a[,4 a;) consists solely of multiples, as long as 

p(A') i 1, by Theorem 1. A' Is schedulable. Sa then uses an 
algorithm. SpedalStagle. proposed In 113} to find a feasible 
schedule for A*. Since a; ^ a^ for all /. tiie schedule found 
for A' Is also a feasible schedule for A. 

Scheduler Sx Is based on the same technique as Schedule 
Sa except that A is specialized widi respect to M. where x is 
an Integer and a,/2 < x 5 a,. Starting from x • aj. Sx special- 
izes A with respect to (x) until x k at/2 * I and chooses an x 
that minimizes p(AO. or until it.finds an x which makes p(A') 
5 I (or until it finds tiiat no such Integer exists). Therefore. Sx 
Is more powerful than Sa In die sense tiiat every pinwheel 
instance that can l>e scheduled by Sa can also be sdieduled 
by Sx. For example, in Sa. A « (4. 7. 8. 13. 24. 28} wltii a total 
density of 0.672 " (« 2/3) is specialized with respect to {4} to 
get A' o (4. 4. 8. 8. 16. 16) wltii a total density of 7/8. In com- 
parison, in Sx. A is spectiOlzied witii respect to (3) to get A' « - 
{3, 6. 6. 12. 24. 24} wltii a total density of 5/6 (< 7/8). 

Schedulers Sby and Sxy are based on the double-integer 
nduction tedinique. and make use of Theorem 2 (as well as 
Theorem I). In general, the double-integer reduction tech- 
nique specializes a pinwheel Instance A with respect to two 
positive integers (6, c}, where h^c<lb and 6 ^ a « a|. That 
is,itflndsana( for each d|€ Asuchthat 



\b 2^ \^b'^f%a^<c ^^ 



2f lfC'2'sa,<t-2^'. 

for some lntegcr7S0.l.e. < = mMb P^^''^Kc-1^'^\ 
Schedulers Sby and ^ differ from each other only In how to 
seiea the two integers frand cfiir the qjedalizatlon operation. 
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Alter the s{>eclallzaUoA <^>efation fs perfbimed. the Sdicd- 
uleis then use an algorithm SpedaiD ubte described in 
il3| to schedule the specialized pinwheel Instance A*. A 
detailed account of all these sdiedulers/algorlthms can be 
found In (121. U3M28i. 

Note that since af 5 a, for all I, we have pCAO ^ p(A). The 
density threshold / of A is then derived In such a way that 
as long as the total density of A Is less than or equal to p* 
then pCAO <> 1 (I.e., A* Is schedulable). In other words, one 
can schedule all pinwheel instances widi densities S 
has been shown in [12|. (13] that the density diresholds for 
Schedulers Sa, Sx. Sby. and Sxy are 0.5. 0.65. 0.6964. and 
0.7. respectively. Note, however, that If a pinwheel Instance 
A has a total densl^ larger than p". It does not necessarily 
Imply that the instance is not schedulable. Instance A can 
be feasibly scheduled as long as the total density of the 
transformed set A' Is less than or equal to 1 . 

In the slot allocation problem. If we require that the slot 
allocation scheme must be able to generate an infinite slot 
allocation sequence such that at least C| slots are allocated 
to M, In any time interval of lengdi D„ It follows that the 
slot allocation problem is a generalization of the pinwheel 
problem (note diat in the pinwheel problem. Q « 1 for all 0. 
Therefore, one plausible a^^roach to the slot allocation 
problem is to view it as the pinwheel problem by consid- 
ering Mt = (Cfr Di as C, copies of In the corresponding 
plnwhed instance. For example, an Instance M = {(2. 5). (3. 7)) 
of the slot allocation problem can be transformed Into the 
pinwheel problem wtth die instance A = {5. 5. 7. 7, 7}. Any 
occurrence of symbol * r or '2" in the pinwheel schedule is 
treated as a slot allocated to M| and any occurrence of sym- 
bol "3." *V or -5- in the pinwheel schedule Is treated as a 
slot allocated to Mj In the slot allocation problem. It 1$ easy 
to see that If the pinwheel schedule is feasible for die pin- 
wheel Instance A, then the corresponding slot allocation 
schedule is also feasible for the Instance M of the slot alio* 
cation problem. However, this approach is not feasible in 
practice, since the method of transforming a stream M| Into 
C, copies of D, In the corre^MXidlng pinwheel Instance 
makes the Input size expand from n to J^* ^ C, (which Is a 
pseudopolynomial expansion). Thus, the schedulers used to 
soWe the pinwheel problem cannot be directly applied to 
the slot allocation problem defined here. 

Moreover, although both the scheduling algorithms 
SpedalSingle and SpedalDouble designed for the pin* 
wheei problem are effective (l.e.. both are parallel fest on- 
line schedulers and need 0(n} hardware |13|. |29)). It Is not 
clear whether or not they can be modified to solve the slot 
allocation problem in which the maximum message size 
1 ^ i ^ ii. can be any positive int^er and arbitrarily large. 
Instead of 1 as in the pinwheel prt4)lem. In Section 4. we 
focus on the single-Integer reduction technique and pro- 
pose a new. simple on-line slot allocadon scheme to handle 
a set of Isochronous streams with arbitrary maximum mes- 
sage sizes. C^. with 0(n) time per slot allocated and 0(l) 



haidware. Note, however, that the double-integer reduction 
technique can also be applied to die proposed slot alloca- 
tion scheme to ftuther Improve the performance. 

4 The SLOT ALLOCATION SCHeAE 

In this sectioa we first describe an on-line slot allocator 
which takes a set of message streamsM » {M|« (C|. D)\\i 
ISn) with JD| I p, for all i</ and p(M) SI as the Input, and 
generates the corresponding slot allocation schedule in 0(n) 
time per slot allocated. We also formally prove the correct- 
ness of the on-line slot allocator. Then, we present the slot 
allocation scheme whidi resides In die slot generator and 
Incorporates the on-line slot allocator to allocate slots for a set 
of general message streams (l.e.. D, | Dj for all / < J may not 
necessarily hold). Finally, we discuss the Issues of Imple- 
menting the slot allocation scheme in a dual-bus network. 

4.1 0n4ine Slot Allocator 

The on-line allocator SlotAllocator OFlg. 3) uses the rate- 
monotonlc (RM) 138| (or. precisely, the deadllne-monotonlc 
(DM)) nde to ass^ message priorities so that the streams 
with tighter deadline constraints get higher priorities. Specifi- 
cally. SlotAllocator treats each message stream M| ^ {Cf. D) as 
a periodic task with execution time Cf and period (» dead- 
line) Dt At any tLme slot. SlotAUocator always assigns the 
slot to the stream with the highest priority among the active 
streams, where an active stream is one >vhose slot require- 
ments are unfulfilled at the current period Interval. Note that 
In SlotAllocator. the for loop from line 5 to line 13 Imple- 
ments dte RM/DM priority assignment rule. The *ir 
statement Qlne 6) and the Bocrfean variable assigned are used 
to locate the hlgtiest-priority stream (le.. the smallest index 
dwt has unfulfilled slot requirement (i.e.. c, > 0) with re- 
spect to die current d^ The SlotManager (line 1) (to be fur- 
ther discussed later) Is die driver to the slot allocator. If 
there Is no message stream with unfulfilled slot require- 
ment (Le., C| « 0 for all 1) with respect to the current d,. 
SlotAllocator will Inform SlotManager that a regular traffic 
(non-real-time) slot shouW be issued (line 14 of StotAllocat r). 
Note that the index 0 in line 14 denotes that the slot should 
be marked as a r^ular traffic slot. 

Let f0 denote the {f • Qth slot assigned to message stream 
Mj. for I 2 / S n and J i 1. In die following theorem, we 
prove that SlotAllocator indeed produces a feasible slot 
allocation for a set of message steams whose deadline con- 
straint multiset consists solely of multiples and whose total 
message density is less than or equal to one. 
THEStxtEMl For a set oflsocbmnoiismcssd^ StreamsM = (Mj^ 
(Cf.D)\l^i^nlifD, I Djforalll<J.andp(M^l, 
then SlotAUocator (which is based on the RM/DM prior- 
ity assignment) will allocate C| slots to M| In any tbne /n- 
tervaJ oflength Dp for all i. 

Proof. It suffices for us to show that 

1) the first C| slots assigned to M| are Oiose slots In 
Interval |1 . DJ. Le.. fa S D^ for att L 

2) ifslot e is assigned to M|. then slot (t-i^ 9 -DJ is also 
assigned to Mj for any tntitgee q such that t+q-Df 
^l.and 
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SlotAllocator 

/• slack time of the cuirent message of H 

/• ec remaining slot requirement w.r.L current d^. v 

/• scndCP. nwsai^: send a message to process P and waU for Its 

recepdont^RV 

/• reoelve(P. luessaig^: wait for and receive a mess^ from 
process P. */ , ^ 

/♦M"{M,«{C^Di I lil^fljisasetoftsochronousmessage 
streams with A I Dj for aU I <i, and p(M) S I. V 

1. receWe^lotManager. M): 

2. for l<- t tondo te*- d|4-I>d 

3. do (/• note that D, 5 13i S ... S Dp •/ 

4. assigned 4- fate 

5. for/«- 1 tondoi 

6. lf(notass^edandCf>0}{ 

7. sendCSlotManagcr. /• assign Ih? current slot to 
message stream M| V 

8. assigned 4- true; 

9. c,*-C|-l: 

10. )/*lf/ 

11. d,4-dl-l; 

12. if(4e»0){c>*-Q<44-D,;) 

13. >/*forV 

14. if (not assi|grie4)said^totMaiiag^.O):/* the current 
will be left as a regular traffic slot V 

15. ) forever 

Fig. 3. The SlotAnocator process. 

3) In anytime Interval of length there are Q slots 
assigned to M,. for ait 1 

We prove conditions 1 and Z by induction on mes- 
sage stream id I Since M, has the tightest deadline D, 
(and. hence, the highest priori^), the q consecutive 
slots from slot 1 to slot Ci will be assigned to M, (i.e.. 
the C| slots do not Interleave with slots assigned to 
otiier message streams). Hence. /„ « C| ^ A (otherwise. 
If C| > D,, p(M) 5 1 does not hold) and condition I Is 
satined for / - 1. From the SlotAllocator process, it is 
easy to see that the n«ct C| consecutive slots assigned 
to M, are slot D| + 1 dirpugh slot D, ♦ C,. Moreover, 
the Jtti set of C| consecutive slots assigned to Jlfi are 
slots (/- 1) • D| 1 tfirough slot (/- I) • A ♦ (Note 
that M, has the highest priority.) Therefore, condition 2 
Is satisfied for I. 

Now. suppose conditions I and 2 hold for all i < ^ 

Since 

I) D,| D|.foral] l^i<i. 

II) f,, ^ D„ for all 1 ^ / < by the induction hypotiiesls 
ofcondltionl.and 

iU) If slot t Is assigned to M,. so Is slot (t^q D^fcx ail 
I S i < 1^ and 4 2 0 by induction hypoUiesls of con- 
dition 2. 

we know that this number of slots assigned to for 1 
i j i k, from slot I to slot Is exactiy C, If con- 
dition 1 Is not true for M*. we have XL^' ^ 
orY* ^ > 1, a contradiction to the assumption that 

p(M)^i« 

Next since fn ^ Ih, from die SlotAllocator process. 



It Is easy to see that the next slot to be assigned to Mi 
Is slot Dt-t- lor law. Since A | D» for I i by in- 
duction hypotiiesis of condition Z. the allocation pat- 
tern fbr the alots assigned to M, . Mw l" ^ 

terval fl, DJ repeats in slot taterval 1) • D| ♦ 1. J- pj. 
for all J > 1. Moreover, the allocation pattern for tfie 
slots assigned to Mi in slot Interval (l. DJ satisfies die 
deadline constraint of M|. and will thus repeat in slot 
interval W- 1) • A I J • all J > l. Note dial 

the message streams. M^i* Mt^j. .... Mg. with looser 
deadlines (and hence, lower priorities) have no effect 
on the allocation pattern of the higher-priority streams 

M,. Ml Hence, condition 2 is true for i= *. 

From conditions 1 and 2. in each slot Interval ((/ - 1) 
. D, 1. J • DJ. diere are (exactiy) C, slots assigned to 
M|. for I ^ I ^ n and J ^ 1. Now, to prove condition 3. 
we need to consider a time Interval of length D| with 
slots t, and ta as die first and die last slots, respec- 
tively, where I ^ t, tji (/♦ I) • for some 
j i 1, and ^ - (t| - 1) » Since the slot allocation 
pattern in dot Interval (t„ J • DJ repeats in slot inter- 
val It,* D^ 0*1) • Di » fe* 1. 1/* 1) • DJ. and since, 
by conditions 1 and 2. the slot allocation pattern in 
slot Interval I/- D,* 1, satisfies the dead- 
line constraint (i.c.. the number of slots in [/ • D, + 1. 
(; * 1) • DJ allocated to Mf is Q. it Is obvious to see 
dtat die slot allocation pattern In time Interval ft,, fj] 
. also satisfies the deadline constraint of M|. I.e.. there 
are Q slots assigned to M| In time Interval U,. tj. □ 

4.2 Slot Allocation Scheme 

We now describe the slot allocation scheme which Incorpo- 
rates SlotAllocator to allocate slots to a set of arbitrary 
streams: SIotManager (Fig. 4) is a driver to tiie slot allocator 
process SlotAllocator. and VCIServer Is responsible for 
mapping a message stream Id to its corresponding VCI 
number. 

To allocate slots for a set of general message streams M. 
SIotManager performs the following steps. 

Step 1. Upon system initialization. gather/mainUin the 
required Information regardlitg the connection requests 

SIotManager 

/•AssumcM«{M|-(C^i I iS/Srt.wheie^Sfti-iD^V 

/* Upon system initialization */ 

I. collect and update M; 

2 spedalizethedeadlinecanstFatatmuidsetDofMtoeetD'tMT, 

3. If (p(MO > 1) M and exit: 

4. else { 

5. send(SlotAllocator.M'): 

6. do { 

7. receive(SlotAllocator. 4: 

8. If (i*0) 

9. Inquire die VCIServer for stream M/s VCI and nil 
dte VCI field of die current slot wlUi stream Mfi VCI: 

10. dse 

II. tag the current slot as a regular traffic slot; 

1 2. wait for dte "time slot* to ftdly elapse: 

13. }forever 
14.1 

Fig. 4. The SlolManaear process. 
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from the Isochronous message streams, especially (C> Di, 
for each stream M, and the source and desttnadon sta- 
UonldsofM^ 

Step 2. Spedallze D » (D,. D^, .... DJ using the chosen spe- 
dailzatlon operation to get the speclaliztd message stream 
set M' with the specialized deadline constraint multiset 

ty = {Df.DS Fo*" example, if the spedalteation 

operation used is the same as that used In Scheduter Sx. 
then we And a for each D| that satisfies 

for some Integer J S 0. where x is an integer e (D,/2» PA 
that results in the minimum p(MO. (Note that £^|I)J. for 
alli</) 

Step 3. Check if the total message density of the specialized 
stream set M' « {M; = (C,.pp|l ^ /s; fi> is less than or 

equal to 1. i.e. if p(MO = C,/p; i 1 . If not. reject M 
and stop. Otherwise, proceed to the next step. 
Step 4. Use the on-line allocator. SlotAUocator. to obtain 
the message stream id llf i > 0. assign the slot to M| by 
filling the dot's VCl field with M/s VCL If i « 0. mark the 
slot as a r^lar traffic slot Note that SlotAUocator will 
generate an infinite sequence of message stream ids foe M' 
such that there are C| copies of message stream M/s id 
wi^ any subsequence of Di) message stream ids. 
Step S. Walt for the time slot to fully elapse, and repeat Step 4^ 

for the next slot. 
Example. Consider a set of isochronous message streams, 
M = {{1, 4). 0. 7), (2. 13). (1. 23). (3. 28)). Specializing 
the deadline constraint multiset D = {4. 7. 13. 23. 28} 
with respect to <3> yields D' = {3, 6. 12. 12. 24). Since 
D^|i:);.foralli<7,and 

P(M') = X =^ 1/3 + V6 + 2712 + 1/12 ♦ V24 21/24 < I. 

by Theorem 3. SlotAllocator can generate a feasible 
sdiedule for M*. By applying SlotAllocator. we ob* 
Uln the slot allocation schedule shown in Fig. 5 in 
which the sequence repeats every » 24 slots. Note 
that because of the RM/DM stheduling property, a 
slot is always assigned to a message stream with the 



tightest deadline among all active streams (which 
have unfulfilled slot requirements with respect to 
their current deadlines). As one can readily see. there 
are at least C| slots assigned to M| in any time interval 
. of length pf D) toe all J, 

4.3 Impfemantatipn of the Slot Allocation Scheme 

The slot allocation scheme can l)e implemented either as a 
SlotManager daemon ^ig. 4) or as a SlotManager chip 
(Fig. 6) that, resides in the slot generator. (Note that the 
hardware chip performs exactly the same functions as the 
SlotKfanager daemon.) The advantage of hardware Imple- 
mentation is speed. It needs only 0(1) time, instead of 0(n). 
to find the Index of the hl^iest-prlority aalve message 
stream and allocate a slot However, the maximum number 
of connections Message streams), n. that can coexist in ^the 
dual-bus network is limited by the number of modules in 
SlotAllocator used to find the highest-priority active 
stream^ 

In order to set up. maintain, and disconnect connections for 
message streams. SlotManager, SlotAllocator, and VCIServer 
co-reside In the slot generator station of each bus. SotMan- 
ager considers each time*constrained cormection between a 
sourceKlestination pair as an isochronous message stream, and 
gathers/maintains all required information r^^arding the ac- 
dve connecdons. especially (C^ and the source and desti- 
nation stations* Ids' for each message stream H 

At system initialization. SlotManager gathers the con- 
nection information. Invokes SlotAllocator to generate the 
' (virtually) infinite sequence of message stream ids with 
which SlotManager allocates slots 0>y setUng the PA bits 
and filling the slots with appropriate VCls). In order to set 
up a new connection, the source station sends a call setup 
request to SlotManager. specifying hA^^ (Ca<^ D«J. Upon 
receiving the request SlotManager specializes the aug- 
mented deadline constraint multiset D U and checks if 
the total message density after specialization is less than or 
equal to one. If not the request for establishing the connec- 
tion Is rejected Otficrwise. SlotManager notifies VCIServer 
of the new cormection which in turn assigns a new VCl num- 
ber for die connection. SlotManager notifies StotAllocat r 
and the source and destination stations of the acceptance of 

2. Thatls. Che building block xoomed out in Fig. 6l 
1 Ibe source and destination station ids are coOeoied for the purpose 0^ 
stot tcioe. We ¥411 brteOy discuss slot reuse In Section 5Z 
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the new connection. SlotAllocator then reestablishes the 
corresponding new schedule. An existing connection can be 
cleared (disconnected) cither by the source or by the desti- 
nation station. The call clear request is s^t to SlotManager 
which responds simply by notifying SlotAllocator to delete 
the corresponding message stream and tag the slots origi- 
nally assigned to the message stream as regular traffic slou. 
The interested reader U referred to [301 for a detailed ac- 
count of how to dynamically establish or terminate a real- 
time message connection. 

5 Comparison WITH Rei^teo Work AND 
Remarks 

5.1 Comparison with Related Work 
As mentioned In Section 1, Uie slot allocation scheme pro- 
posed by Saha et aL In |4S] comes closest to ours but differs 
in formulating and soh^g the problem. This difference has 
yielded a signlAcant consequence explained below. 

They adopted the commonly-used peak-rate message 
model which Is. as discussed in Section 2.2. a ^>eciai case of 
the message model we used in this paper. 

They ftm devised a slot allocation scheme based on cy- 
clic reservaUon and derived the schedulabtlity condition 
under two assumptions: 

1) message streams are all periodic with periods Pi ^ D^, 
foralUand 

2) message arrivals are aligned with the beginnings of 
allocation cycles, and the length L of the allocaUon qf- 
cle evenly divides Pf for all /. 

They derived the schedulabillty condition as *if 




then the set of message streams Is schedulable, where 
= is the average slot requirement of M, per alloca- 
tion cycle.** (Note that L Is the length of the allocation cy- 
cle and P/L is the number of allocation cycles In a time 
period P|.) Then, in order to handle the fact that messages 
may arrive In the middle of an allocation cycle and thus 
may not utilize the allocation <^cle. diey modlHed R| as 
= C,/((f5 - L)/L) = LC,/(i; - L). In order to relax the as- 

sumption that P| « they subsequently modified* 12^ as 
C|/((0| - IJ/U e IC|/(D, - L) . The achedulabUlty con- 
dition then becomes 

o a if* 

Now. if there exists some such that £>» I. then in most 
cases the schedulability condition does not hold, wtiich im- 
plies that their scheme cannot guarantee to find a feasible slot 
allocation schedule for such a message stream set. Moreover. 

it Is also difficult to choose an af^ropriate L value If GCD(Dt, 

Di Is small. Actually, the schedulabillty condition 

could lead to an erroneous result if L Is larger than 0|. 
for some i (in which case J^!^ < 0. One such example is 

M « {(C|. D|. P^ II » 1. 2} » {(3. 4. 9. (3. 5. 5)) and L » 6. *rheir 
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sdtedu!abHiOr condition yidds + ^ « -27 < 6: how- 
ever, the mess^e stream set Is not schedulable (see Lemma I 
In Section 3). In contrast our scheme can definitely find a 
feasible allocation schedule as long as the total message 

density. Xm^'/^'* ^ ^ ^ * certain den- 

sity threshold (e.g.. 0.65 If the specialization ofwatlon used 
Is the same as ttiat used in Scheduler SjO. Actually, our 
scheme can find a feasible allocation schedule for a message 
stream set as long as the total densl^ of the message stream 
set after spedalizaUon Is less than or equal to one. 

Another shortcoming of their slot allocation scheme is 
that their scheme requires that the message arrival times 
are known to the slot manager a priori, which is only true 
for strictly periodic streams. Therefore, their scheme cannot 
be used to guarantee message deadlines fbr streams that 
conform to the (C D)-sroooth message model, but arc not 
periodic, ff^ote. however, that they also proposed a more 
complicated Implementation which can handle both peri- 
odic and sporadic streams and uses the queue arbitration 
function defined in the DQDB standard. The use of queue 
aibitratlon function is out of the scope of this paper.) 

S.2 Remarks 

In this section, we compare the slot allocation problem with 
the distance-constrained scheduling problem described in 
1231. 125). We claim that the solution schedule to the slot 
allocation proUem generated by SlotManager can also be 
used as a schedule for the discrete-time version of tfie dis- 
tajKe<onstraiMd task system (DCTS) 1231, 1251. In die DCTS 
model, two consecutive executions of the same task must be 
well-spaced and "close* to each other. Specifically, given a 

dlstance^onstralned task set T ^ (T^. T| T^. where each 

cask T| has an executton time d and a (temporal) distance 
constraint D,. if fy denotes the finish time of tiie Jlh execu- 
tion/invocation of task Tj, then the dlsUnce constraint D, 
for T, requires that fn ^ A and fy^, -f^^Dt, for all J ^ I. On 
contrast. In the traditionafreal-kime task model 1381. every 
task must be executed once during a certain fixed period. 
The execution of a task in one period is independent of tiie 
execution of the same task in any other period.) 

Now consider a distance-constrained task set T In which 
both C, and D, are int^ers. for all L From the proof of Theo- 
rem 3. it is easy to see that a feasible schedule produced by 
the proposed slot allocation scheme. SlotManager. is also a 
feasible schedule for tiie task set T. For example, tiie slot 
allocation sdiedule In Fig. 5 is a feasible schedule for a dls- 
tance-consualned task set T « {(C,. I><) 1 1 ^ / ^ 5} = {(1. 4). 
(1, 7). (2, 13). (1. 23). (3, 28)) (it Is easy to check that fn ^ A 
and fy^, - /^^S A. for I S 5 and I). 

6 Conclusion 

We have proposed a slot allocation scheme for allocating 
bandwidth and guaranteeing die timely deUvecy of tiie 
messages In isochronous (real-time) streams in a slotted 
dual-bus network. The duai-bus network configuration Is 
general enough to acconunodate several MAC protocols 
suggested for tiUs topology. e.g.. DQDB 1321. Fasnet |37|, 
CRMA HU. and SSiaple 136]. The (C £9-anootii message 



model used in tills paper Is mart general, and Indudes tiie 
peak-rate model and the linear-bounded model as special 
cases. The proposed sk>t allocation scheme uses an on-line stot 
aHocatkm algoritiim of polynomial-time complexity (whose 
correctness Is rigorously proved). The resulting scheme Is 
guaranteed to find a feasible slot aUocatton schedule for a set 
of message streams as long as the total message dendty cS the 
set Is less tiian or equal to a certain density tiueshoki For ex- 
ample, currently, tiie best density tiireshdd Is 0.7 forthedou- 
ble-lnt^ reduction speclalizatim operatioa 

The message density threshold actually serves as a met- 
ric for evaluating the predictability and stability of an allo- 
cation sdieme. By predictability and stability, we mean that 
given any set of isochronous message streams, as long as its 
total message density is less than or equal to the derived 
threshold, a feasible slot alk)cation schedule is guaranteed 
to be found by the proposed slot aUocation scheme. Moreo- 
ver, a message stream can be fireely added to. or deleted 
from, tiie given message set as long as the total message 
density is held below the tiireshold. This complements the 
work done by Uu and Layland (Ml in deriving the proces- 
sor utilization bounds for scheduling computation tasks in 
a single CPU environment. 

We have also addressed the Implemenution Issues of the 
proposed slot allocation scheme. We configure all the func- 
tionalities Into three modules: SlotManager, SlotAllocator, 
and VCIServer. all of which can be implemented as soft- 
ware daemons, or on a VLSI chip. Botii implementations 
. are simple and practical. 

The performance On terms of tiie nuntber of message 
streams that can be scheduled) of the proposed slot allocation 
scheme can be Improved by using the concept of slot reuse 
|17|. 1311, 1351. 1481. 1491. 1501. That Is. tfie cell tiiat has passed 
on to its destination can be taken out of the slot in order to 
lefease the skit for reuse by downstream stations. For example, 
^ven that ttie sources and the destlnatkms.of two message 
streams (ccmnection^ and are stations Nf and Ny and 
Nf and N^. respectively. If < Nf ^ AfJ < N/. tiiese two 
connections are actually spatially nonintersecting. and. hence, 
can use die same virtual connection. Le.. use all the prearbi- 
tcaled slots asdgned to tiie same VO. We have studied tiie 
pioblon of grouping spatially nonintersecting message 
streams into stream subsets. All the message streams In a 
stream subset use all tiie prearfaltrated slots witii the same 
VCIs for message transmission. The Interested reader is re- 
ferred to |24| for a detaUed account 
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A Pipeline-Based Approach for Maximal-Sized 
Matching Scheduling in Input-Buffered Switches 

Eiji Oki, Member. lEEE^ Roberto Rojas-Cessa. Student Member, IEEE, and H. Jonathan Chao, Fellow. IEEE 



Ahstraei—Th\M letter proposes ao innovative pipeline-based 
ouiKlniaMized roaUhfns scbedoliag approach, called PNfM, 
for input-bttfTered switches. It dramatically relaxes the dming 
constraint for arbitration with a maximal matching scheme. In 
the FMM approach, arbitraUoa operates in a pipelined manner. 
Each subscfaeduler Is allowed to take more than one time slot for 
its matching. Every Hme slot, one of them provides the matching 
result* The subscheduler can adopt a pre-existing efficient 
round-robin-based maximal matching algorfthnEi. We show that 
PMM provides 100% throughput under uniform trafGc since it 
preserves a dcsynchronlzation effect of the round-nbin pointers as 
In the proexisting algorithm. In addltiOQ. PMM maintains fairness 
for best-effort traffic due to the roand*roblo*based arbitration. 

imUx r«nns— Inpiit-buirered switch, maxImal-dzBd matching* 
pipelinei scheduling. 



I, iKTRODUCnON 

THE EXPLOSION of Internet traffic has led to a greater 
need for high-speed switches and routers diat have over 
I -Tbit/s throughput Most high-speed packet switching systems 
adopt a fixed-size cell in the switch fabric. Vuiable-length 
packets are segmented into several fixed-sized cells when 
they arrive, are switched duough the switch &bric, and are 
reassembled into packets before they depart. 

There are various types of buifering strategies in switch 
architectures: input buffering, output buffering, or crosspoint 
buffering (7]. Input buffering is a cost-effective approach for 
high-speed switches. This is because tr^>ut-buffercd switches 
do not require internal speedup or allocate any buffers, at each 
crosspoint It relaxes memoiy-bandwidUi and memoiy-size 
constraints. 

In input-buffered switdics, the Vutual-OuQwt-Queue (VOQ) 
structure is used to overcome a pipblem associated with First-In- 
Fust-Out (FIFO) input queueing, which is called Head-Of-Line 
(HOL) blockingproblem P]. For a crossbar switch with VOQ's. 
maximum-sized matching algorithms have been proposed to 
achieve 100% throughput (4], but they suffer from high-com- 
puting complexity. Therefore, it is difficult to implement such 
aigoriduns for high-speed switching systems. 
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Maximal-sized matching algorithms have been proposed as 
an alternative to maximum matching ones, such as tSLlP [3] 
and Dual Round-Robin Matching (DRRM) [1], [2]. Bodi algo- 
rithms reduce their computing complexity compared with max- 
imum matching ones, and provide 100% diroughput under uni- 
form traffic and complete fairness for best-effort iraffic How- 
ever, they still have a strict constraint diat the maximal matching 
has to be completed within one cell time slot. The constraint 
is a bottleneck when die switch size increases or a port speed 
becomes high (e.g., 40 Gbit/s), because the arbitration time be- 
comes longer than one time slot or die time slot shrinks, respec- 
tively. 

To relax the strict sdieduling timing constraint, a 
pipeline-based scheduling algorithm called round-robin 
greedy scheduling (RRGS) was proposed by Smiljanic ei aL 
[5], Each input has only to perform one round-robin arbitration 
within one time slot to select one VOQ. RRGS avoids output 
contention by allocating more than one request to the same 
output into different time slots in a simple pre-determined cyclic 
manner. However, RRGS is not able to provide max-nnin fair 
share for a best-effort service. This is due to the simple cyclic 
mechanism. When traffic is not balanced, some inputs can 
unfairly send more cells than others. Smiljanic also proposed 
weigbted-RRCS (WRRCS), which guarantees prereserved 
bandwidth [6]. In WRRGS, however, the feimess problem is 
not yet solved for best-effort traffic. 

It is a challenge to find a maximal matching scheduling 
scheme to meet the following requirements: 

• scheduling time relaxed into more than one time slot; 

• high duoughpul provided; 

• fairness maintained for best-effott traffic. 

This letter presents a solution to these requirements. We 
introduce an iimovative Pipeline-based approach for Max- 
imal-sized Matching scheduling in input-bufTeted switches, 
called PMM. Within the scheduler, more than one subscheduler 
operate in a pipdined manner. Each subscheduler is allowed 
to take more than one time slot. Every time slot, one of them 
provides die matching result. The subschedulere can adopt a 
pre-existing round-robin-based maximal matohing algorithm 
such as iSLIP and DRRM, while preserving ihc properties of 
the original nonpipelined version. As a result* PMM imvides 
100% throu^iput uiuler imifbrm traffic since a desynchro- 
nization effect of the round-robin pointers is preserved. In 
addition, PMM maintains fairness for best-effort traffic due 
to the round-robin-based arbitration while RRGS adopts die 
piedeterministic cyclic selection rule. 
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Fig.t. Timing diagnmofPMMwhfalGs 3. 

II. PMM 

A. Switch Mode! 

Consider an K » N' input-buffered switch with VOQ's at the 
inputs and a crossbar switch febric. A fixed-size cell is sent from 
any Input to any output, provided that no moire than one cell is 
sent from the same input and no more than one cell is received 
by the same output Each input i has N VOQ's, each of which is 
denoted as VOQ(i, i), where cells that are destined for ou^iut 'j. 
are stored. The HOL cell in each VOQcan be selected for trans- 
mission across the switch in each time slot Tbeitfbre, every 
time slot, a scheduler has to determine one set of matching 

B. PMM 

The PMM scheme is able to relax the computation time for 
maxinoal-sized matching into more dian one time slot. A main 
scheduler consists of request counters and K. subsdiedulers. 
Each subsdiedulerhas '10 request flags. Each subscheduler op- 
erates the maximal-sized matdiing in a pipelined manner, as 
shown in Fig. 1. Each scheduler takes Id time slots to complete 
the matching. In each subsdseduler, we assume that one of the 
pre-existing maximal matching algorithms, DRRM, is adopted 
to simplify the description below, otherwise stated." 

Several notations used here are defined in the following. A re- 
quest counter 5^) is associated with VOQ((i, 53- The value 
of iRC(», 5) is denoted as C(t, 'dS. vi^erc 0 < C7^t, 1;,) 2 JCw- 
LoBut is the maximum VOQ occupancy. C{t, g) expresses the 
number of accumulated requests associated widi VOQ(i, ^ 
that have not been sent to any subschedulers. Each request flag 
BF{i, h *) is associated witfi VOQ(i, j) and subscheduler *, 
where 0 < Jb < liT * 1. The value of Rr(t, j, k) is denoted 
as F(t,i7fc), where 0 < F(i,i,fc) < 1. F(f,j,fc) = li 
means that iiqsut i has a request to output j in subscheduler k. 
F{i, J, h) = 0 means that input i has no request to output j m 
subscheduler ik. At initial time, j.) and F(t, j, A:) are set 
to zero. 

PMM operates as follows. 

• Phase I: When a new cell enters VOQfi, ?/), the counter 
value of JlC!(il, is increased: j) = ©(i, j) 1 

• />Aa5e 2.* At the beginning ofeveiy time slot t, if C^, S9 >s 
OandiE(i.g,fc) = 0, where * = t moS !K(, C^t, 5) = 

j) - V and ar, k) = t. Otherwise, Gft, S) and 
F(i> j, k) are not changed. 

• Phase 3: At ICi ^: k < t » Kil +: 1) m k, where / 
is an integer, subscheduler k operates the maximal-sized 
matching according to the adopted algorithm. 

' PMM described beic is tiso aUe to adopi other mu-oun lUr stive slgo- 
cHhfns such as tSLIP [3]. 
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• Phase 4: By the end of every time slot i. subscheduler k^ 
where* = (t-(iir.-i)) mod Aj, completes the matching. 
When input-output pair (i, b) is matched, F(», J, k) = 
iF(i, a, «) - 1. In this case, the HOL cell in VOQ(i, j) is 
sent to output 17. at tiie next time slot > When input-output 
pair (i, 5) is not niatchcd, 5, k) is not changed. 
Whenever a condition associated with any phase is satisfied, the 
phase is executed by die main scheduler. 

To apply the DRRM algorithm as a matching algorithm in 
subscheduler k in PMM, we use !F(t, '% k) instead of VOQ re- 
quests as described in the DRRM scheme [I], [2]. Each sub- 
scheduler has its own round-robin pointers. The pointers in sut>- 
scheduler H are influenced by the results only from subsched- 
uler k^ not from other subschedulers. The opmition of DRRM 
in subscheduler ft is the same as diat of die nonpipeiined DRRM 
sdieme. 

III. PERfOKMAKCe 

This section describes duoughput and delay performance 
of PMM under uniform traflic, and die £umess for best-effort 
traffic* 

A Throughput 

PMM that adopts the DRRM algoridim in the subschedulers 
provides 100% throughput under uniform traffic. 

The reason is as follows. Consider die input load as 1.0. If 
some inputs cannot seiui cells, outstanding requests are main- 
tained in each subscheduler. In odier words, Oi A) — ^- As 
a result, C!(i, SF^ is not always dcocmented in phase 2 and in- 
creased in phase 1 . Since G(ni, ^} reaches a large enou^ value 
to be always satisfied widi 02(4, 5) >. 0, 5, ft) = 1 is 
kq)t in phase 2? In diis situation widi IF:(^, 1^ ft) = I at any 
t in phase 3, subscheduler ft diat adopts die DRRM algoridun 
provides a complete matching result every !K; time slot due to 
the desynchronization effect of input and output arbiters, as de- 
scribed in [2].^ 

Thus, PMM preserves die throughput advantage of DRRM. 

As explained in [3], DRRM and iiSLIP do not provide 
100% diroughput for nonuniform traffic widiout increasing 
the internal switching capacity. PMM also preserves this 
characteristic which can be overcome in die same way as for 
die preexisting schemes. 

B, Delay 

The scheduling time K does not significantly degrade delay 
performance, as shown in Fig. 2. In this evaluation, we include 
absolute delay caused by die scheduling time in the delay per- 
formance. When *Ki increases, requests from iRC7{i, iO are dis- 
tributed to 'AE(», Sr'i ft) associated with each subscheduler ft. 
Therefore, die desynchronization effect becomes less efiicient 
widi Ki at die light traffic load. At die heavy traffic load, die 

^nus «xsuret that cells fhmi the oine VOQ are transim 

\VL(x, o\ - C(i, 3i > 2 . when: ir'O, j].) is the occupancy of VOQ(r. j). Note 

that x<r. 5f) - c(i:n = EJS)* n^* 5. 

^Although £(»! h ^) becomes 0 in phase 4 when a request is granted, 

Jj Q isahfvayachangedtol tnphtte2. 
<VU^ note that the pointer desyiichfQiiizttion is achieved wi 
cchednier. There is norelationslup of poiateia among dHTefcnt subsehedateta. 



mi 0 at: A PfPEUNE-BASED APi1UVtf3t roR 



265 



10000 




Offered load 

Fig. 2. Detoydependency on scheduling time Ki(switchsitt 

delay dependency on K becomes negligible. Therefore, Kiocs 
not affect delay performance for a practical use. 

The delay performance is in^roved with more iterations. 
Since PMM relaxes the scheduling timing oonsfiraint, a large 
number of iterations is not a bottleneck even when the switch 
size Increases or a port speed becomes fast, compared with 
the noi^>ipeiined algorithm. Note that we showed the delay 
performance with one and four iterations because there is no 
measurable improvement with more ttian four iterations. 

C. Fairness 

Since we adopt a round-robin-based algorithm in subsched* 
uler A;, subscheduler can maintain a fair scheduling. Therefore, 
PMM provides max-min &ir share for best*effoit liafific. 



IV. COHCXUSIOMS 

This letter prq)0sed a PMM scheduling approach for input- 
buffered switches. It dramatically relaxes die scheduling timing 
constraint Eadi subscheduler is allowed to take more than one 
time slot Every dme slot one of them provides die matching re- 
sults. The subscheduler can adopt a pre-existing efficient max- 
imal matching algoridim such as iiSLIP or DRRM. PMM max- 
imizes the efficiency of the adopted arbitration scheme by al- 
lowing sufficient time for a number of iterations. We showed 
that PMM provides 100% throughput under uniform traffic and 
keeps &imess for best-effort service as the preexisting algo- 
ridundoes. 
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A Hard Real-Time Bus Arbitration Protocol based on £IAr709 
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%bs6*ac< 

Fieid ^ea neiworla ^eidbussesj- afl^^ 
ana pfcqtpUaSwhs.A WpfSiff^nT'pi^^ 

process atdornation^_hard (( 

^ans lUie cfy. LpnWorh J^ 

709) wtticH is'^r^ used for home and hujlding 

automdiion. ' Cdmpanies developing ^ LgnWoHi prtMcS, 
are starting to evaluate the possibilities of using the same 
technology for highspeed and reaMme applicgUons in 
order to avoid the overitead' of several d^er^, 
communication protocols. This paper describes, the 
development and lmpimentatipn pf g hard reql'time biis 
titration schmeforiHeHigK^ 
^Lchip^ which prjocfi^ 
£IA'T09 protgcpritjf^ 
herein combines ^pptimm 
hierimc^'^gi^^ 



t: Introduction . 

In all 'fields : of Iactwpfiq5g7^^ 
applicatioib '^reguifing ..-cithir soft J/xir-lSidf^^^ 
operations. Since jh_ " juatqtr^onl'" technology;., 
decentralized . j^proach' S^*"^^ vMri '-^i more' 
ttistalLadons, " rcal:tiinc,.;^nc^S^^ "j-J^oomji^' 
iziipoiibiit issue. TTie^djfferencc bcf^ 
real-time systems" ' becriV*<Ui^u^ 
publications, e.g'. X i ]. Crucial for tiie desig^ of a haid real- 
&he system is abovie' ali* the pr^dictiBlnh'Q^ under certain 
conditions. This detenninistic behavior combined with a 
mechanistn to guarantee the adherence to haid deadlines 
(100 %) under the stated constiaints separates the lianl 
real4ime behavior of a system, from ..a soft lealTtinie 
system, where even under most ideal coi^tiotis only a 
probability of coxrcct behavior Q?a be 'speciBca.;^^.? 
n^woik systeoii' real^^ne b^yipf mo^.iro ib' 
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tlw Ji^uii5tci37^^^ "data 

^^missjqk '^n^ t^inf r^-^ 

juirlutecliure; pves 'lhe gTOt' advanta'g^ jhe 

piidiisibili^ 5^ conr^iex distributed sysl^^ 

FpT. i^^n^i>n ;!^^l9©^ <*lffe«^l network 
piptixqtis'e^ catiegohz^ ^'iReld area 

netw)^ks'4^^ ^jplication areas 

are: 

• hpnie and btulding automation, 

• multimedia, consumer electronics, 

• proqess'andlndustn autpmatitb', 

• fuiraaft ;(fly4>^ 'im<l . aulpmdtiyc (drive-by-wirc) 

• in^dKcal'^andscienfific insiniineniatibn. 

to i^Uce 
tpMrds. ri^-lime 

iMiati^fliJ^fp??^^ XlHPx. which 

Se mSnlyjus^ ^^^.-^^^^^^^P^^^^^ 

gin^Sn^Ificl^iSIp)^^ bmlding 

QSriMgtd ^l'sfitc K. Is- a y«y flj^SBjjf^^u^pn w^^^ 
6^^yiirall"igeas^'of lufe The 

fijnn^6^w3f coiiVehi^ .basdd 
on gfc^8?^5iLnAi!TO tjft 
ti^ stendi^fdia^tionTori^^ 

ANSI^IA^W^^^ ENyi3i54-2 PI, 

con^>anies and researcli institutions are free to develop 
and integrate more powerful implementations of the 
standard. 

A main develppmeht in this context is the. L-Chip, a 
network controller from the Austitan company LOYTEC 
for processing the lower three layers of the ANSI/EIA- 
709.1.protpcor(8]. The Jiigher protocol layers as well as 
thc .»bae .appUc^^ CPU.. This way the 

li^stlSm^^ of thi'prol 
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hard real-time' aibttration scheme presented hqtin. aie 
to^lemented in hardware. The host CPU is scabble and 
can be selected according to the application reqiiiirements. 
Protocol modifications in higher OSI layers (4-7) can be 
incremented in software allowing a very short design 
cycle time. The target is to be able to create flexible id^ 
6peed'BA-7()i9 nodes lyhich can offer higher perfomiance 
and at the same time interoperability with conventional' 
LonWorks networks. Due to. its implementatiph .^cture 
and the additional ihtq^tibn of a special ubltratipiTmo^^^ 
presented, "in'ihis pajwr,. the. L-Oup, can be used to build 
hard tc^I-fimc lEIA-7p9\lcc« 
sy^ems. 

2.TeaiirreSs.I^^^ 

The inaih . featuie&'of We !fi^ r«al^timel.a%r^ 
mode as well as the/cpmii^ons. nece^^ 
ml-tiroe n^mrments aie^^]6nefly'jdfs<^ b«l2M(!-.TBS 
load and fault; h>Tp^te^ 
works, correctly, can eaaly be.^ derived. /frpnT 
coiidifions. 

2*l.:Arbltratf OA Protocol Architei^^^ 

Hie EIA-709 protocol is based on. a bus stracnire 
(broadcast bbhavidr) and uses a bus arbioration mechanism 
called "predictive p-perdstent CSMA with optional 
priority**. Althou^ -collisions are. rediic^ed compiired id 
standard CSMA (Cirrier Sense Midtiple Access), there is 
no guarantee that a collision is resolved, withixi ,a certain 
period'of tiine. The^ai^r^^ tak^ l^r dMiThard ral-tiin^ 
mode, of the LrCpp wig^ tb '<feyelpp "^ colli^ton-less 
protocol. It b ba^im'l^^^ 
ElA-769 .Si ibp^i^pn^^^ 
additsiqioijy^^o^^ 

2^.PrcdicfabiIiih^anaH« ^^ 

Ris iniportantlfqr predi^^ility.and re^'lft^ 
that ti&e reqtureme^^^ 

time: The h^ real-time linikle of tfie IXSiip guarantees a 
lower bound for data tfaro||^ut as wdl as . an upper 
bound for d>e frame dehty of each nodcTeyen in 'fiill load 
situations. The load hypotheses in tiiis context would be ja 
worst-case scenario of all nodes on the bus trying to send 
fiill q>eed at the same time. Despite the foot that a priority 
scheme is used to determine the order m which the bus.is 
assigned, to .the transmitting nodes, fiforvation of low 
priority iu>des is prevented by' thie protocoE 

ATO.tfaa* .uniH>ri^^ issue of haM i^-titiiejs^tem^^ is 
the startiip befovior. Using ' ma$ter/noii<master 
disdhctfon. .deterimhisidc.bdu^ can. .be achieved liy 



specifying a single master in the system which starts the 
node synchrbnizatioa by sending otit the first frame. This 
way collisions can be avoided even if all nodes are 
powered up at die same time. 

2J.Flexibflity 

There is always a badeoff between predictability and 
flexibility of ft.sy$.tenii. sc.c.[l], Chapter 7, The number of 
n6des*lsigether.>i^ fa^dwidth aligned to. each nod^ 
jgig^ jiuffl W^^ctcd"^^ the required, n^t^ 
packctllelay tBiiic^'^Howevcrr^wth U^^ mode 
of JiL J^Chffi *^S^ify. JMi. individual 

fii axtmum '^ pacicet liMt ph ^d' hence" baiidwidth) f^r each 
5ode;;'!^ "3^i]a>Tia^ ^iiipofC bNetween 

maximim.p^et^leng^^^ tuiie. Further* nodes 

9!^!ilIMisconne^D^ f^ repjurj-' at 
aiy time while flie''netwofk is operatmg.' 

2.4.I%rfoimince 

Since fficfTirCHijOP^ tipruSJfixed"tinw^s^^ 
frame trahsmi»iion, flTere is very .Uttle^'protqcol oVCTbead 
assigni^. 'tpVlM.syhc&^^ The typical 

frame delay .is much snialler than the maximum delay in 
load situations. Tlie bandwidth utilization of Ae network 
media is always optimized automatically. If e.g. only two 
nodes have concurrent transmit requests at a time, the 
compfeie available bandwidth is assigned to these two 
nodes. 

2.5^ IniteroperabUi^ 

Sj^' P^Hp ba sed^iiiq^ ^ 
fiUiyn^itttcrcyerablrCci^j^^^^^ 

jwithTIA'709 r^'^i"n ip>i) re^tjib^ 

gjodcjiw^-tfe j(^nH)a^^^ based nodes 

P? J^^^JS^MM^jchi^^^ by 

tb diicmg^the N euron CSSs to .priority 

'^EB'IransmiSaonTTHbwe^' ffif 

]iB^^.!noinK star^pn cahiK>t.t»e irvoidol 

iiiT16adi»tc£tiiuis. 

2«6. Fault Tt^lerance 

A cmcial point ii) hard real-tinie systems is friult 
iplerance. It is not acceptable fliat the real-time abilities of 
the network are dependent on single nodes (single point of 
failure)..As' already mentioned, the hard real-time protocol 
descnb^ herein toloates hon-working (silent) nodes as. 
well as nodes* beirig rertM>Ved during operation. When e.g. 
the. node, with the /highest .priority, foils^ to sciid the 
isyhchfonization sig^nals (tem[K>nuily or permanently), a 
different node automaticaliy takes over and . ensures 
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wires to s^i^Ue jiiii^Sii^ 

didnselves. Howwcr,Jimccltf^ 

of eadb no^ 8~(Kfecfive nbde^^wniiiw^ 
firanies with the wrong fiintng could' .rono^ 
bcQccblock-Uie ne^orlL.This mswis thafTthe ai^tniSbD 
protocol Is not fault folerant to so called "babbling idiots".' 

2.7. Physical Network Media 

The physical channel in EIA-709 is nol fixed so that 
there are many transcdver options such as twisted pair, 
power line, radio frequency and even fiber optics. To 
allow the usage of already existing channels (mostly 
twisted pair or power liiie), no addition^ wiring ^11 be 
necessary to iiiu>leinent.&e i^-tim'ep^ 
the 'arbitiatibh scheme is thi^t bus adivify and ioactfyit^ 
can be detected correctly by all^ nodes at a^'tfiofte. To be 
more exact,, each node miist; detect ti^'l^gih^^hig ji^^ 
end . ' of ' each firaihc . on the cha^iici; . including 
synduonization frames/ AdditidnaUy, ni^^ 
the media during |dlc time mii^. i^^^ tbTfil^ly 
detected" fijmacs "sii^ tjusjcbiild jpreye^^ tK>dc f "from 
soididg out Its fr^ne pnd tKs^wy'' mfruiip^ 
deadliiae. Fi^er/ ^e.p&jii^ .n^<^^^ 
the nunimum S3^j^N>xuzs^OT~^q^^ 
ffiitepropaptidn^pgM'bf M^ 

For a l)etter understanding, and'as an !mVbductfon to 
the. bard real-time, aibi^tion scheme, the bus arbitration 
mechanism of the EIA*709 protocol sbould be 
investigated first (Figure 1), 

tae I Staldlt 't 2 3 , n'l 2 3 16.8L Bittldle 

Figure 1. EIA7709;arbiitralioii Qrcle. 

As" shown * m7Figwr;"U t&e.'w called j^iime 
tm^ediateiy foltpvi^la firaiiMs.tnn^MCb^'.OT Ihe 
^24ime jC8mIl« as. anrinfer firaine' y^^lg 

scparaie ^.^'fiaiSes ^n^ jno 
transmissioii by an}^odes'u^'anowe^^ 

A numbS* bf: < cquS^sgcir pro sJots'^priBg 
duration '.follow the ftlij^ntiyynng cohfi^w ragdnJo C 
ttie nctwwk eacl^nc^ cfw^ !^^!^'ib:bn^^ 
one br the n'piibiily "slotsVira^ 
prionty packet, the lesultiiig frtone is froasnuned on *^e 
media staiting at* tliCbegmrimg.of tfielfode^Li^b^ sl^ 



jAJl.btfc»r yall.detect the busy, state 

of ife" cn filia a iid Tiive t6 waitlmtii ffi^bus gets idle'again 
*b'!^^'jtR. ji$iltari<^^ shown jn'Figu^'T. It Ts 
bbwus'tiiira a'pripfi^^ slot i has a 

h>ig}lir linfS&31^M P!^^ < / is 

iTp avoid' ihat a high pribrity node A. blocks.die whole 
nelwork dive to c^ntiiiuos semting of priority packets, a 
ribde must in general not send two consecutive frames in 
its priority slot but must send the second frame in a 
randomizing slot (explained below). If a differeot node B 
sends a frame on the network before node A accesses the 
biis to transmit (be second fiaine, node A is allowed to 
retry to send the second frame in its priority slot after the 
frame from node B h transmitted This niechanism was 
intended' to provide an acceptable tradeoff between 
minihiizing the access tithe on the bus for priority packets 
on one hand.and avoiding blocking of Uie network due to 
continuous l^ccess* of hig^^ on the other 

luuid.lHowever, sdi^tne dcK^ irat prevent two hijjti 
prugty nqdw network by 

lyterna^lX^ 

gddiM~;^li6^^ but; 
icanliilsoltn^simiL fi^ .'in. lonel. of. !the * so called 
^n SSSSom SSS^jlS^' wKch"foH6w*tKe ii^piimity slots/ Tl^ 
SSratmlof one and is 

^^Witolhe'd^ slot tsu^.' IfJa rhbde A 

^'^ai^&J^ s^d^^ a stpty out of 

/dj^oqiSlXidis^^ <o 
ggg^.^iRUi|e-^il^^ If 
SLdiJ^erentr^ode iff picte .a slot i .toTsend. a frame in the 
same lubim^ / <y, node A has to wait until 

tiie'^^ b^ ge^ idle a^uii to retry the transmission in the next 
arbitration cycle. If both nodes -A. and B select the same 
slpi (r» 7)ihe Iraiiies will collide pn the media. Destroyed 
frknes are detected on OS! layer 2 due to an incbrrect 
CRC (Cyclic Redundancy Check). Both nodes A and B 
have to repeat their transmissions, hopefully picking 
different slots i for the next arbitration cycle. 

The vahie BL is the so called ^'channel backlog** which 
is' adjusted to the estimated network traffic of the. near 
fri&re to mininuze' collisions due to a Ugb bandwiddi 
ufilization.on..one!hand and to minimize Idle times when 
IfiETiiefvvork. .iSrafiic. is low- on the ' other band, it can be 
gniined /iMj^r tinje the current' value of BL is 
consistent: in 7airn^^ 6n*the,n^brk.. The described 
cubitiatipn^lscKi^^ is. niijiied'''*pWi p-persistcnt 
CSKM"_CC§m« Multip.le>cccss); 

irJJsfiSiidr. beT .mcfddbned-l.di^ 1 :most LonWorks 
BjlpJi^SbSJare'pnly malqng jisejof tli^ 
(no pri'oritif slots 'e^^'on jhe nctWprk<b^ cbr^'guririg 
HiTparametg «rOOp7pS^ 
ajgj^thm'is welllsui'ted 'for 'the main application area of 
CdhWbrferw^^^ au tomation. However, ance 

g^slpins^ tlit^.nibiUa/an^,dK 
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caa not be eosiucd. This is because of the possibility of 
multiple (infimte for the worst case) ooilisions, one 
occurring after the other. No frames could be tranonitted 
correctly in such a situation. 

At the first view it looks, like, collisions csui be avoided 
if all nodes onjhe network ]m^o'nl/.im ]pri^tO!ot£! 
This assumption is coarrect.fqr cdniinuoi» netw^ 
However, if there is no fafljc for a'fiahe 



:(i) 



the nodes ' on "the "iictwi*" a^^ 
anymore. That is,.no.pripnty slots e?us't and "foFfgtKer 
transmission a node siinply vraits'f^ rando^^dela^^ 
transmit request 'fipm .the'.lipper pipto^l layeTs' i^^^ 
granted by the MAC Q^!^}^. Acoe^~.Coi#>l) byer. 
Hence, collisions can occur ^ although priority sIo& are; 
usedl MorvSiver,' stsffvatibn^ii possiblej^d^^ 
priority nod^ bloddng tK^ network ^'alrb^y explained 
above. 

4. Basic Concept of a Hard Real-Time Bus 
Arbitratidn Protocol 

To allow. timeH;^tt.cal applications using EIA-70% the 
MAC layer has to be" idkpted to perfiMrrn. re^-time 
operation. Hiat is, a certairi niaximum time ' initerval 
bcftVeeh a fransfmt jre^ a'pijcetjaml tte^^^^ 
transmisaon must ]^'guarmte©drTKe U-Oi$/d^eloped 
by the AusMw' con^a^ CQYTEC. based ' <m rw^wcH 
ddi»*yrtlMfel^^ 

inaprei»mts''Js^^ . JoNil-^^3^^"i^ti9^^&^ JJ^l^^^ 
coiicqit is"fe' utiliz e ''tKr jmorit y'llo^ ^ 
desCTibed'ih^^Chapi 
ei^ nodes ooimectM^^^ 



!«— t 1 1 " 1 

1 MUb U 2 } 4 5 « 7 -t> MMW < r.'STJ 4 S ( T. J * 
I I t t t J I I I t M < L > f t M 



AfthikA 



Figure 2. Time slots fdr 'dkta transmlsdon and 
iB^chronizatibn in the reaf^tfine o^de 

If a node A associated with slot / f 4 requests to send a 
packet it has' to waft* for its priority slot.- lif a node B 
associated with slot i < 4 is already traiismittihg a firaine, 
node A cannot send in Jhis. cycle. After. a n^^fm 
transmitted % fianie,'all'n<}^ leturii to'tlfe 6^^^ 
the cycle sbown in Figure 2. This way all nodes are 
synchronized after each frame and cqllisjons.are avoided 
If no node wants to send', every nqck'.waW ufitil |the 'gine 
for the priority slots is elap^. After that Ad ndde which 
is associated to slot' 1 lu^,^ s^ 8X^^^>!S£^pa 



ftame in order to syndinmize all nodes for the next cycle. 
If this node cannot send the synchronization finame due to 
ibm^ &ilure, the node associated to slot 2 has to send it 
instead and so on. This way the time slot procedure stays 
valid . even if .some nodes have crashed or have been 
i^i^y^ at sqioM poir^^^^^ 

jQ6e/probiem left is that ife.g^ the node associated to 
slpt' CWidd no other node 

wouldljEKaThave a' chance* to .transmit. . To avoid this 
scenario^ a' node, can only send a new fiame when it has 
received at lo^^one s^chroniz^on fnuhe, meaning. that 
for onebyctefnni .9^^ has transrhined.^H^ every 

node ge^^^ ^enlf all odier 

iuxfiShave COTCunraJ 

-ifii7dier~Jl^bl^^ " rcft .lsf . fficl .^.s^^ of the 
synchromzati<m''cy^^^ after power-up. For this 

purpose/a dedicated master node is intrbduced/The master 
noctriL^^ 9^ .^lo^^l^.ll^^ oinworlc^&H is allowed to 
tnoisinjit a fiipD^Vuutiiilly, ste|ting ffie 'syiiclironizatio& 
cycleifter a r^et All other nodies have to receive at least 
one fiame before they are allowed to access the media 
after power-q> in order to avoid collisions. To ensure a 
correct startup behavior even iii die case of malftinction in 
the master node, more than one node in the netw^ can 
be configured as a master. Each master node waits for a 
raiidom delay , after power-up until the first fiame is 
traiisini^d 'td/avpid t>i^ mas^ colliding tipbn 

transmitting the first fhuhe. In the case of multiple masters 
@!^i^Y^oil!^li^c startiip.bd^ real:, 
Biii^fTnfo "Sft. redt45ic"^.^'d^^ fo^ .tbe'ppsObiU^^ of; 
CjijysijwisV^ fir^J5f¥0C..^er ppwir-up/JTte^ is, 'an 

yKe:protd<»rd«cnbed^abw^ 'aissuirw cp 
jBnsmsmi ssidn .-of '" frorosT" "•However^ . cycn\; . jf the 
synchiOTizati OTlc ^^ '^is TotIoto reasoh - eg" due to 
^J(^<^r'aj^ .Qocur,. liodes aie^blc to r^- 

S)^^^ronize ^^^'j^ ^/!?¥pJ<i^PiPX^ frames which 
have at jeajt' 'the SMC the regular 

s>na^iroma^ tke.|*hani'real-ltime'* 
^^stem would' sl^ Ifiim iiito.l "j^^ system, 
which, means tfaat'diere is .a 'probal>ility"that after a* 
cehi^ tiiiie firames get traittmitted correctly. 

S: Compafibility witli Convehtioiial EIA-709 
Nodes 

In principle the real-time ^mechanism described in 
Chapter 4 o^y works for IrCtdps vMch implement the 
^edicated'aiintir^oh scheih^ i<dthough convenddnal EIA- 
709 iuxfes (Neuron Chips) can also use pribnty slots, the 
foi jovtilig is^ to be kept in inind: 
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• Frames which are .meant to be sent m .mi&mizing slots 
can not be' transmitted due to the sjnacEronccation cygle. 
Besides, . &a98mis$i{>hs. in'.iiindotp. $(p|s. jm^ . t^i. to 
collisions vAidi ^ not be allowed, for the real-time 
mode. 

• In the ElA-709 MAC layer the pioblism of starvation is 
solved using a procedure (switching to transmission in 
random slots after each priority packet) which difTers 
from the discussed reaNttme mode. This means that for 
Neuron Chips working in. the real-time environment 
setup by LChips, starvation can not be avoided. 

The chosen impleihentaiion'only si^pprts priority slots 
of Neuron Clnps wth "ijie ^wfaaclc of the pos sibility of. 
starvation. THe . rdiuffihg 'ffiis^^ transmission c^le ls 
^wn in Figirc 3. 



1^ 



I- 



* 1""^ \ 1 

t t I I I t t I I , , , I 1 f I > I t I i 



raoHiirSldtt 



Figure 3. Neuron Chips ih tlie reai^jne ^yironmeht 

Wheii iM^L^ in rcal-tiTOl modc^ 
transmit a pai6kct,'*the conhe<^ NetuppfCH^^^^ 
allowed to send , due to dieir fiJ-tmc. However,, bii? 
activity is mooitdred by Neuron Chips after tlieir ^2-time; 
Hence, L-Chip transihission^ wiU always Se detected'^ 
iong as the transmission exceeds the" end of th^ ^/vtimc.: 
Like described in Chaptw- 4, there always. M to be one 
cycle without an L^CHip being allowed to sendr.evCT if all 
L-Oiips lutve pending transmtssions/Jh thb 
Neuron Chip with.Uie.hipest prioi^J^^ 
slot) tlifll wants to s^ihd is diov/ed tio transmit fts'lraura^ 

6. Response Time 

The real-time ;Cpnccpt just ^ d^CTbed " enstgK_ i 
nuuumum respcmse jimE.li^^ 
the iictual .stari orUle/trarismisab^^^^ 
calculated MJibe worst case^wKich'^.meaixs? 

• The node' in questm v^ts to Iwnd tw paccfe with7nj? 
idle time mbetweeii; 

• AlTbther^nbdes gef p^ requests to tmn^t s^ly 
after the hode in question has transmitted its first fianie. 

• All nodes want to send pack^ of n^aximum t^hgdi. 

FunhernKxre, the iresponse time depends on: 



• the fittximuih ftame transmission, dme (r/>and)t 

• the transmission time of a s^chronizatibn frame (/j^, 

• the. number of real-time L-Chip nodes >yhich is the 
number of priority slots (n), 

• the priority slot number of the node in question (0. with 
fin the range of 1..II. 

The time between the first and the second frame of the 
node has to be calculated from the values /j to as 
described below. 

Thejime ii tdkesfir aU piher nodes to transmit their 

firgmesXtO'. 

T his is 'because pf iKeassuiMMn that all other nodes 
fe^^^pTftansniit^^ bfefori: tfi^'n6d^ in qu^tiolOnodb* 0 
aSdlhura rc illow^ to'tammit s«e (2).' 



: • (2) 



Now thif ni^ traigniittedr^h pf them ha^ 
to, wait for one cycle wiffiimt tranm see (3). It is 
assu^^d* t&t; node Pserkte* 'tfi^* synclupnizsBtipn* frame 
meaning it bias rid fiiither iransiiiit request yet. Otherwise 
it could already send its next frame. Further, the response 
time for/node r must be caM^ted difTerentiy since, as 
loehtidiSd above, tfils node can always transniit a>rejgular 
frame in. the syhchrotiization cycle. Therefore all 
fdtto wing calculations coiirespond to ai node number 
*' r tim HI> l)V 



(3) 



foQSXiOOdso assur^ pat no other nporfeal-time 
noi^i^eQrm;]^^ cycle,' 
s^'Fig^.?!^?Ftius was ife case, it wo^d be pd»ible that 
t)\^isS^'6s^ci^^ cii the djowcd frame transmission 
GS^b!^n]^•6me.n^E^. 

mmUer ^'ff tot^^ 

Afltor''the:s^chR^^ 'noldcs with liigher 

prionty; thui tl^'nade m questiqn^can ag^in fransmit. their 
franiiwiisee (4).* 



^3 ^hftimi). 



C4) 



ithe^/-ttme(/^;)» 

» die widdi of the priority slots (/aJ^ 
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THejmengdei has /o iwt/}b7/l(i /iTOn 
TUs is the' tiine betv^h the'^.of the. last^ 

nodes witE 'Iug^ pnonty to'tbe stirt pr 

fifom node/; see XS)V 



(5) 



This leads to the response time for nodes in the range 
of2..ii,see(6). 



tfyfpfi:^ 2..nJ = fi + 0 + /, + ^ = ... 

+ / + + j 



(6) 



For node 1 .the response time only consists of (2) and 
the first part of (3), see (7). 



... = (« + /).f^/+^^j.lI./5,«+(ll-7).J 



ifchfiiiiei' tfiirf influ^TM 
numbierof nodes: 

i::: The 'gf W^ ^^/G«^cK jaep^^^ 

sfots, scie Figwe j) ^ Nei^^ 
2... .The maxlinum frame .transmissicm. time (Mvt ^hiph 
has to be chosen careftilly since it is a main factor of 
the response tiitiel However; in many hard re»al-time 
systems ohiy short ntessages Qito e.g. "open valve'O 
are needed. 

3. The priority slot time (qm. which must be long enough 
to ensure synchronization amongst all nodes' in the 
network. Heiice, the tninimum possible slot time is 
dependent oh .the node> clock accuracy as well # the 
fietwqrk si^' &d\!the Sgnid'pr6p^tignt;spe^^ 



Tibtell » Wprat case respopse'tuiii^ TiJ^V^. f^^7^ 
f Mbp^ chWiikid with' n « 16,- tstnt^'Ll.^^^Mf^ 
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5.24 ms 
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Table I gives an exan^>le of the worst case response 
times tit^ for a 78 kbps and a 1^5 Mbps channel, which 
are often used for LonWorks. 

{7. AppUcfitioD Example 

One ppssible a^Ucation ar^ for the current 
imprmemaUon the hard real-time niode of the IX^hip 
is. the .usage for; ba^^^ GIA*7Q9 trnUti* 

port lnetwrk,>>wtch'.."L^wte^ con^jany 
LOYTEC :f9V a^^ gucE a' Jiigh 

twia^ |»ir cfgE^din'^te^-time riiiode. see' Figiffe 47By 

j^i^RD^em^^j 




Figure 4. Collisionless backbone 

In this case, the . main goal was. to raise the 
predictability of the backbone and especially avoid 
starvation of switches. In a load scenario, where traffic is 
gtmirdicd from all . switches on the badcbone at the same 
;^me,, the .wbgreddon j>rpt that the .available 

bahj3^ tt^ switches and 

Ijsq §^s^i^ W3t^fJ^o^ fer tfae deliay time of each 
P^.?R^9p!C^^^b'.l^^ swi&h'ing 

8* Cdnclusidn 

j Slh is'papCT T hafd^r ^ bus arbitration protocol 
b ask^T^ V^SI/ElX-709 'w^ After haying 

SuUfiga tliilfetags gffl'wSQifijjetenfe of ffie'profocol .^e 
iwdrtif case" res pond the'given .CK>nsAtnts 

twe'dfeny^';^^ exainple. 
■^^speds^ pf Infi^^era^iU^ wiOi' cpnveiitiohai ;EIA-709 
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